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(57) Abstract 

There is diclosed a PHA biosynthesis-related DNA fragment, which comprises the genes for PHA synthase, /?-ketothiolase and 
acetoacetyl-CoA reductase, which are all derived from Alcaligenes latus. The DNA fragment is inserted in an expression vector. E. coli 
which is transformed with the expression vector carrying the DNA fragment can produce the PHA biosynthesis-related enzymes as well as 
accumulate PHA at a large quantity by culturing it in one-step. 
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POLYHYDROXYALKANOATE BIOSYNTHESIS-RELATED GENES 
DERIVED FROM Alcaligenes latus 

BACKGROUND OF THE INVENTION 

5 

Field of the invention 

The present invention relates to polyhydroxyalkanoate (hereinafter 
referred to as "PHA") biosynthesis-related genes for PHA synthase, p- 
ketothiolase and acetoacetyl-CoA reductase, derived from Alcaligenes latus, 
10 their amino acid sequences, a recombinant plasmid carrying these genes, and a 
method for massproducing PHA using these gene. Also, the present invention 
relates to polyhydroxybutyrate(hereinafter referred to as "PHB") gene derived 
from Alcaligenes latus, its amino acid sequence and a recombinant plasmid 
carrying PHB gene, and a method for mass-producing PHB using the gene. 

15 

Description of the Prior Art 

Petroleum synthetic plastics are so durable that they are not degraded in 
usual conditions at all. Because the production amount of the petroleum 
synthetic plastics increases each year, the environmental pollution ascribed to 
20 petroleum synthetic plastics wastes are now a big social problem. To solve the 
problem of non-degradable plastics, active research and development efforts 
have been and continued to be directed to biodegradable polymers all over the 
world. 

Biodegradable polymers are the high molecular weight materials that are 
25 completely degraded under natural conditions after a period of time. Many 
biodegradable polymers have been developed. Of them, PHA, a natural 
polyester which is synthesized and accumulated by microorganisms, is of 
particular interest because it is superior in biodegradability as well as shows 
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physical properties similar to those of the synthetic plastics in current use 
(Anderson A.J. and Dawes, E.A., Microbiol. Rev., 1990, 54, 450-472; Lee, 
S.Y., Biotechnol. Bioeng, 49:1-14,1996; Lee, S.Y., Trends Biotechnol., 
14:431-438, 1996). 

In detail, PHA is an organic reserve material, which can provide an 
intracellular store of carbon or energy, usually found in Pseudomonas, 
Alcaligenes, Azotobacter, and Bacillus spp.,etc. It is detectable as granular 
cytoplasmic inclusions. As a general rule, the cellular content of the reserve 
material is relatively low in actively growing cells: They accumulate massively 
when cells are limited in nitrogen, phosphorous, sulfur, oxygen, etc., but still 
have carbon and energy available. This reserve material was first found in 
Bacillus megaterium by Lemoigne in 1925 (Lemoigne, M., Bull. Soc. Chem. 
Biol., 8:770-782, 1926). Since then, its chemical and physical properties have 
been extensively researched. Poly(3-hydroxybutyrate) is the most widely and 
first known PHA. 

According to the number of carbon atoms and the substituents in 
hydroxyalkanoate, many PHAs were reported. In general, PHAs are divided 
into two classes ; short-chain-length PHAs(SCL PHAs) and medium-chain- 
length PHAs(MCL PHAs) 

SCL PHAs include poly-(3-hydroxypropionic acid, poly-P- 
hydroxybutyric acid, and poly-[3-hydroxyvaleric acid, which are produced by 
Alcaligenes eutrophus, Azotobacter vinelandii, methylotrophs, etc. SCL PHAs 
are widely used due to their similar properties to polypropylene, a kind of 
chemically synthesized plastics. 

MCL PHAs, composed of 3 to 9 more carbon atoms than SCL PHAs, 
are produced by Pseudomonas spp., by using alkane, 1-alkene, C 6 ~ C 12 
alkanoic acids as a carbon. 

2 
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Since early the 1960s, it was recognized that PHA could work like 
thermoplastic polymers. Thereafter, attracting a great attention, many types of 
PHA copolymers were synthesized, which are superior in mechanical 
properties as well as in biodegradability. By virtue of these advantages and 
5 owing to the environmental pollution aggravated by petroleum synthetic 
polymer wastes, PHA is now actively researched and developed as an 
alternative for plastics over the world. In addition, biocompatibility and 
bioabsorptivity allow PHA to be used in a variety of fields, as materials for 
agriculture, medicinal care, drug transfer system, and package, and as 

10 precursors for fine chemical products (Holmes, P. A. in Developments in 
crystalline polymers. 1-65, 1988). 

Taking advantage of various bacteria, molecular biological research has 
revealed that there are four different biosynthetic pathway for PHA 
(Steinbuchel, A. in Biomaterials: novel materials from biological sources, 215- 

15 262, 1991). For example, for Alcaligenes eutrophus, the most widely known 
bacteria, p-ketothiolase, acetoacetyl-CoA reductase and polyhydroxyalkanoate 
synthase (PHA synthase) are known to be involved in the biosynthesis of PHA 
(People, O.P. and Shinskey, A.J., J. Biol. Chem., 264: 15298-15303, 1989; 
Schubert, P., Steinbuchel, A. and Schlegel, H.G., J. Bacteriol., 170:5837-5847, 

20 1 988; Slater, S.C., Voige, W.H. and Dennis, D.E., J. Bacteriol., 170:4431- 
4436, 1988). 

A concrete biosynthetic pathway of PHA in Alcaligenes eutrophus, 
gram negative bacteria, is as follows. Between two molecules of acetyl-CoA, a 
carbon-carbon bond forms in the presence of p-ketothiolase, the product of 
25 gene phbA , according to a biological Claisen condensation. The acetoacetyl- 
CoA thus formed is converted into D(-)-p-hydroxybutyryl-CoA by the 
stereoselective reduction of NADPH-dependent acetoacetyl-CoA reductase, the 
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product of gene phbB. Finally, D(-)-p-hydroxybutyryl-CoA is polymerized via 
ester bond by PHA synthase, the product of gene phbC. 

In order to clone the genes which pertain to the biosynthesis of PHA in 
other bacteria than Alcaligenes eutrophus, much effort has been made. That is, 
5 the comprehension of the biosynthesis of PHA in bacteria makes it possible 
efficient production of PHA, versatility of substrates, synthesis of new PHA, 
and development of biopolymers similar to PHA. Further, recombinant strains 
which are obtained by utilizing the PHA biosynthesis-related genes can 
synthesize various PHAs at high efficiencies, resulting in a scientific and 

10 industrial significance (Lee, S.Y., Trends Biotechnol, 14:431-438, 1996). 

Strain Alcaligenes latus is reported to be so superior in the production of 
PHA that it accumulates PHA in cells at a proportion of around 90%. Also, 
Alcaligenes latus has the advantage in that it grows fast and uses inexpensive 
substrates as carbon sources (Wang, F. and Lee, S.Y., Appl. Environ. 

15 Microbiol., 63:3703-3706, 1997). Unlike Alcaligenes eutrophus, Alcaligenes 
latus accumulates PHAs while they are growing. Thus, Alcaligenes latus can 
mass-produce PHA by one-step culture although the amount is low relative to 
that upon Alcaligenes eutrophus. 

The use of Alcaligenes latus to produce PHA began in earnest in the 

20 mid-1980s by Chemie Linz AG, Austria. Biotechnologishe forchungesellschaft 
mbH, Austria, developed a process in which a one-step culture of strain btF-96, 
a mutant strain of Alcaligenes latus., produces PHA, asserting that one ton of 
PHA is obtained from a 15 m 3 fermentor per week (Hrabak, O., FEMS 
Microbiol. Rev., 103:251-256, 1992). Alcaligenes latus also produces poly(3- 

25 hydroxybutyrate/3-hydroxypropionate) as well as poly(3-hydroxybutyrate/4- 
hydroxypropionate) in a medium containing disaccharides as carbon source by 
addition of 3-hydroxypropionate and y-butyrolactone (Hiramitsu, M., Koyama, 
N., and Doi, Y., Biotechnol. Lett., 15:461-464, 1993). 
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PHA can be produced by chemical process as well as biological process. 
However, Commercially favorable production scale of PHA is possible only by 
biological process. Since the production cost of PHA is much higher than those 
of other commercially available synthetic polymers, new technologies are 
required to reduce the production cost of PHA. Particularly, recombinant DNA 
technology gives a great contribution to the development and modification of 
novel strains, showing the production of novel polymers, utility of low-priced 
substrate, high efficiency of production, and facility in separation and 
purification. In order to develop such recombinant strains, first of all, it is 
necessary to understand the enzymes involved in the biosynthetic pathway for 
PHA. 

In order to mass-produce biodegradable, natural PHA and its 
copolymers, the inventors have cloned genes for polyhydroxyalkanoate 
synthase, P-ketothiolase, and acetoacetyl-CoA reductase, and determined 
amino acid sequences and gene sequences. They have made expression vectors 
carrying the above genes and transformants, whereby polyhydroxyalkanoate 
can be produced and accumulated. 

In addition, the inventors have cloned gene for polyhydroxybutyrate 
(PHB) and determined gene sequence and amino acid sequence, and made 
expression vector carrying the PHB gene and transformant, whereby 
polyhydroxybutyrate can be produced and accumulated. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a photograph showing opaque colonies of recombinant E. coli 
containing PHA biosynthesis-related genes derived from Alcaligenes latus, 
formed on a solid medium. 
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Fig. 2 is a photograph showing that recombinant E. coli containing PHA 
biosynthesis-related genes accumulates PHA in a broth. 

Fig. 3 is a base sequence 6.4 kb in size, which contains the whole PHA 
biosynthesis-related genes derived from A Icaligenes latus. 

Fig. 4 shows a restriction enzyme map of a 6.4 kb DNA fragment 
containing PHA biosynthesis-related genes derived from Alcaligenes latus, 
along with a gene structure. 

Fig. 5 shows the gene structure of recombinant expression vector pJCl 
carrying PHA biosynthesis-related genes derived from Alcaligenes latus. 

Fig. 6 shows the process of preparing the recombinant expression vector 
carrying PHB synthase gene derived from Alcaligenes latus. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a polyhydroxyalkanoate biosynthesis- 
related gene. 

The present invention provides an expression vector containing the 
polyhydroxyalkanoate biosynthesis- related gene and its transformant. 

The present invention provides the method of preparing the 
polyhydroxyalkanoate synthase. 

The present invention provides the method of preparing the 
polyhydroxyalkanoate. 

In addition, the present invention provides a polyhydroxybutyrate gene. 

The present invention provides an expression vector containing the 
polyhydroxybutyrate gene and its transformant. 

The present invention provides the method of preparing the 
polyhydroxybutyrate synthase. 

6 
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The present invention provides the method of preparing the 
polyhydroxybutyrate. 

In the present invention, genes for the biosynthesis of PHA, are 
separated from Alcaligenes latus, which accumulates PHA while growing, 
whereby biodegradable, natural and industrially useful PHA and its copolymers 
can be mass-produced. 

In more detail, the total genomic DNA separated from Alcaligenes latus 
is partly digested by restriction enzymes and the resulting DNA fragments are 
inserted into vector pUC19. E. coli is transformed with vector pUC19, 
followed by the selection of the recombinant vectors with a PHA biosynthesis- 
related DNA. The bacteria harboring the interest DNA was observed to 
accumulate PHA on a solid medium and in a liquid medium, as shown in Figs. 
1 and 2, respectively. 

Isolation of the recombinant vector from the transformed bacteria 
capable of producing PHA, is the first thing necessary to identify the DNA 
fragment of interest. Various analytic works show that the DNA fragment of 
interest is 6.4 kb in size, containing the genes coding for all of the p- 
ketothiolase, acetoacetyl-CoA reductase and PHA synthase. 

Therefore, in accordance with an aspect, the present invention pertains 
to a PHA biosynthesis-related DNA fragment containing a PHA synthase gene, 
a P-ketothiolase gene and an acetoacetyl-CoA reductase gene, in due order, 
which has a size of 1608 bp (corresponding to 536 aa), 1176 bp (392 aa) and 
735 bp (245 aa), respectively (see, Fig. 4). 

Sequencing analyses reveal that the PHA synthase gene has a base 
sequence of Sequence 2 with a corresponding amino acid sequence of Sequence 
5, as suggested in the accompanying Sequence Lists. The p-ketothiolase gene 
has a base sequence of Sequence 3 and the p-ketothiolase expressed therefrom 
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has an amino acid sequence of Sequence 6. The analyses also give that the 
acetoacetyl-CoA reductase gene has a base sequence of Sequence 4 which 
corresponds to an amino acid sequence of Sequence 7(see, Fig. 3 and Sequence 
Lists). 

5 The recombinant vector anchoring the DNA for biosynthesis of PHA 

was named pJCl (see, Fig. 5) and the transformant, E. coli XL-1 Blue/pJCl, 
was deposited in Korean Collection for Type Cultures, Korean Research 
Institute of Bioscience and Biotechnology on Nov. 5, 1997 and received a 
Deposition No. KCTC 0398 BP. 

i o In accordance with another aspect, the present invention pertains to the 

preparation of the PHA biosynthesis-related enzymes by culturing host bacteria 
which harbor a recombinant expression vector containing the PHA 
biosynthesis-related genes. 

In accordance with a further aspect, the present invention pertains to the 

1 5 production of PHA and its copolymers by use of the above host bacteria which 
can express the PHA biosynthesis-related genes. To this end, E. coli was 
transformed by the recombinant expression vector and after selecting, the 
transformed E. coli was cultured in a liquid medium containing glucose in 
suitable concentration to produce PHA. Where the E. coli was cultured in this 

20 manner, PHA was observed to accumulate until it represent as much as 40 % or 
more of the dry cell weight. 

In addition, this invention provides polyhydroxybutyrate synthase 
(hereinafter referred to as "PHB synthase") and genes thereof. The total 
genomic DNA separated from Alcaligenes latus is partly digested by restriction 

25 enzyme, followed by selecting the DNA fragment showing positive signal by 
use of PHB gene derived from Alcaligenes eutrophus H16 as a probe. Plasmid 
vector pAL32 is obtained by inserting the above PHB gene into pSK(+). 

8 
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The pAL32 is digested with EcoRl and Notl to obtain the PHB gene and 
then the resulting gene is inserted into plasmid pK230 of broad host range to 
obtain the recombinant expression vector pKTC32. This pKTC32 can express 
the gene in various host cells. (see Fig. 6) 
5 The transformant Alcaligenes eutrophus LAR5 obtained by inserting 

pKTC32 into Alcaligenes eutrophus DSM541 which is lacking in PHB gene, 
was deposited in Korean Collection for Type Cultures, Korean Research 
institute of Bioscience and Biotechnology on Nov. 11, 1997, with a deposition 
No. KCTC 0568 BP. 
l o When the above transformant A Icaligenes eutrophus DSM54 1 (phb) 

/pKTC32 is cultured , it is observed that PHB synthase is produced in the cell 
cytoplasm in the form of white particle. 

The invention will now be illustrated by the following examples, but not 
be limited in scope by reason of any of the following examples. 

15 

EXAMPLE I : Separation of Genomic DNA from Alcaligenes latus 

The strain Alcaligenes latus (Wang, F and Lee, S.Y., Appl. Envirn. 
Microbiol., 63 -3101-31 '06, 1997) was cultured overnight in 500 ml of an NB 

20 medium (8 g/L nutrient broth). The bacteria in an initial stage of exponential 
growth were harvested by centrifugation and washed twice with saline-EDTA 
(0.15 M NaCl, 0.1 M EDTA, pH 8.0). The washed bacteria were suspended in 
40 ml of 0.1 M saline-Tris-Cl (0.1 M NaCl, 10 mM EDTA, pH 9.0) and 1 ml of 
lysozyme solution (20 mg/ml) prepared just before use was added to the 

25 suspension. This suspension was dropwise added at 37 °C with Tris-SDS 
buffer (0.4 M NaCl, 1 mM EDTA, 20 mM Tris-Cl, pH 7.5, added with 5% 
SDS) with slow agitation. When the resulting solution became viscous, 5.5 ml 
of Proteinase K (10 mg/ml) was added and the total solution was incubated at 

9 
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37 °C for 2 hours to remove proteins. Next, equal volume of phenol was added 
to the solution and well mixed for 30 min at room temperature with caution. 
After the solution was centrifuged at 6,000 rpm for 10 min, the supernatant was 
transferred to a fresh beaker followed by volume-measurement, and slowly 
added with two times the volume of cold ethanol to precipitate the genomic 
DNA which was, then, rolled up with a glass bar. The DNA was dried at room 
temperature and dissolved in 10 ml of TE buffer (lOmM Tris-HCl, ImM 
EDTA, pH 8.0). Thereafter RNase was added to the above solution until the 
final concentration became 50ug/ml and the total solution was incubated at 
37 C for 1 hour. Then the same following process, i.e. mixing with phenol, 
centrifugation, volume mearsurement, addition of cold ethanol, rolling up, 
drying, and resuspension in TE buffer, was repeated. The only difference was 
that the concentration of TE buffer was 2ml. 

EXAMPLE II : Cloning of PHA Biosynthesis-Related Genes 

The genomic DNA of A lea ligenes latus, obtained Example I, was partly 
digested by restriction enzyme SauiAl. Because restriction enzyme Sau3Al 
recognizes a specific four-base sequence in double-stranded DNA and cleaves 
both strands of the duplex at a specific site, various DNA fragments ranging 
from a small size to a large size can be obtained. These DNA fragments were 
separated according to size by electrophoresis on a low-melting temperature 
agarose gel. 

To obtain the whole PHA biosynthesis-related gene, only the genes 
which were as large as or larger than 4 kb in size, were selected and inserted in 
plasmid pUC19 2.68 kb in size. To this end, first, the plasmid was cut with 
restriction enzyme BamHl which leaves the same end sequence with restriction 
enzyme SauS AI. Then, the genomic DNA fragments at least 4 kb long were 

10 
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ligated with the opened plasmid vector pUC19 by using T4 DNA ligase (New 
England Biolabs). 

The recombinant vector thus obtained was used to transform E. coli 
XL 1 -Blue (Stratagene) with the aid of an electroporator. When the 
recombinant vector pUC19 which contained the whole PHA biosynthesis- 
related gene at a BamHl cloning site was taken up by E. coli XL 1 -Blue, white 
colonies were formed on a solid LB medium (tryptone 10 g/L, yeast extract 5 
g/L, NaCl 10 g/L) supplemented with ampicillin, X-gal (5-bromo-4-chloro-3- 
indolyl-(3~D-galactopyranoside) and IPTG (isopropyl-l-thio-P~D- 
galactopyranoside). On the other hand, where the bacteria contained plasmid 
vector pUC19 without a DNA insert, blue colonies were formed. Through this 
procedure, colonies containing plasmid vector pUC19 with a partial genomic 
DNA insert of Alcaligenes latus, were selected. In order to determine whether 
these colonies were able to produce PHA, they each were inoculated in a broth 
capable of accumulating PHA. 

In result, recombinant E. coli which was able to accumulate PHA, was 
obtained. From the recombinant E. coli, the recombinant plasmid vector was 
separated. An analysis data showed that the recombinant plasmid vector 
pUC19 anchored a partial genomic DNA of Alcaligenes latus, 6.4 kb long and 
that this DNA fragment contained the PHA synthesis-related genes. In 
addition, base sequencing analysis revealed that the 6.4 kb DNA fragment 
coded for all of the PHA biosynthesis-related enzymes, that is, P-ketothiolase, 
acetoacetyl-CoA reductase and PHA synthase. 

In the present invention, the recombinant expression vector was named 
pJCl. The transformant which harbored plasmid pJCl was deposited in 
Korean Collection for Type Cultures, Korean Research Institute of Bioscience 
and Biotechnology on Nov. 5, 1997, with a deposition No. KCTC 0398 BP. 

li 
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EXAMPLE III : Structure Analysis of PHA Genes Derived from A. latus 

The 6.4 kb DNA insert ligated to the plasmid vector pUC19 was 
analyzed to contain all the genes for B-ketothioIase, acetoacetyl-CoA reductase 
and PHA synthase. These genes were positioned in the order of PHA synthase, 
fi-ketothiolase and acetoacetyl-CoA reductase from the 5' end to the V end. 

Regarding the sizes of the PHA biosynthesis genes, the PHA synthase 
gene, /? -ketothiolase gene and acetoacetyl-CoA reductase gene were 1608 bp 
(536 aa), 1 176 bp (392 aa) and 735 bp (245 aa) long, respectively. 

EXAMPLE IV : PHA-Producing Recombinant E. coli Containing PHA 
Biosynthesis-Related Genes Derived from A. latus 

The recombinant expression vector pJCl anchoring the 6.4 kb genomic 
DNA fragment of Ale aligenes latus was used to transform E. coli XL 1 -Blue. 
Since the bacteria which took up the recombinant expression vector could grow 
in a medium containing ampicillin, selection of the E. coli transformants was 
made on a solid medium containing 100 g/ml ampicillin. The selected E. coli 
was cultured in a defined or complex liquid medium containing 20 g/1 glucose 
to produce PHA. When the strain was cultured at a temperature of 30 or 37 "C 
in a flask, PHA was accumulated until it represented as much as 40 % or more 
of the dry cell weight. 

As described hereinbefore, the PHA biosynthesis-related genes of the 
present invention are derived from Alcaligenes latus and contains all of the 
genes for PHA synthase, B-ketothiolase and acetoacetyl-CoA reductase. When 
E. coli is transformed with the PHA biosynthesis-related genes of the present 
invention, a one-step culture of the transformant E. coli can mass-produce 

12 
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PHA. In addition, these enzymes and the genes are very helpful in 
understanding the biosynthesis of PHA in a molecular biological level. 

The present invention has been described in an illustrative manner, and 
it is to be understood the terminology used is intended to be in the nature of 
5 description rather than of limitation. 

Many modifications and variations of the present invention are possible 
in light of the above teachings. Therefore, it is to be understood that within the 
scope of the appended claims, the invention may be practiced otherwise than as 
specifically described. 

10 

EXAMPLE V : Separation of PHB gene from Alcaligenes latus and 
determination of its DNA and amino acid sequence 

In order to separate PHB gene, total DNA extracted from culture of 
15 Alcaligenes latus and digested with restriction enzymes such as BamHl, 
Hindlll, Smal, Xhol, and Sail and the DNA fragment was obtained. 

Among the resulting DNA fragments digested with BamHl, the 3.2 kb 
DNA showing positive signal , was separated by using 1 kb PHB gene derived 
from Alcaligenes eutrophus as a probe. 
20 Then the separated DNA was ligated to the BamHl restriction site of the 

vector pSK(+), whereby recombinant plasmid pAL32 was constructed, (see 
Fig- 5) 

As the result of analyzing the pAL32 DNA sequence by Sanger Method 
25 (dideoxy-nucleotide chain termination method), it has revealed that the PHB 
gene derived from Alcaligenes latus consists of 1,608 bp. The amino acid 
sequence of the PHB synthase encoded by the above PHB gene, was'analyzed 
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by using PC/Gene software program. PHB synthase derived from Alcaligenes 
latus has the amino acid sequence composed by 536 amino acids. 

EXAMPLE VI : Construction of recombinant expression vector 
5 pKTC32 containing PHB gene 

PHB gene is obtained by digesting pAL32 with EcoRl and Noil, and 
then the resulting DNA fragment was ligated to the restriction site by EcoRl 
and Notl. (see Fig. 5) 

10 

EXAMPLE VII : Preparation of PHB-producing recombinant 

Alcaligenes eutrophus LAR5 

The recombinant expression vector pKTC32 of Example VI was 
15 introduced into the strains of A. eutrophus DSM541 which is lacking in PHB 
gene. When culturing the transformant, PHB particles in the cell were 
observed. 

EXAMPLE VIII : Identification of primer region of PHB gene derived 
20 from ,4. latus 

For the purpose of identifying the PHB primer region, the total DNA of 
Alcaligenes latus was separated. The site wherefrom RNA transcription starts 
was determined by primer extension method and then the promoter region 
25 consisting of 210 bp DNA upstream was obtained. The gene sequence of 
promoter region of PHB was analyzed by PC/Gene software program . 
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INTERNATIONAL FORM 

RECEIPT IN THE CASE OF. AN ORIGINAL DEPOSIT 

issued pursuant lo Rule 7.1 



TO: Lee. Sang Yup 

Expo ApL. 212-702. Chunmln-dong. Yusong-ku. Tacjon 305-390. 
Republic of Korea 



I. IDENTIFICATION OF THE MICROORGANISM 



Idcuiificaiion reference given by Uic 


Accession number given by the 


DEPOSITOR: 


INTERNATIONAL DEPOSITARY 




. AUTHORITY: 


Escherichia coll XLl-Bluc/p JC 1 






KCTC 0398BP 



II. SCIENTIFIC DESCRIPTION AND/OR PROPOSED TAXONOMIC DESIGNATION 



The- microorganism identified under I above was accompanied by: 

[ x ] a scientific description 

[ ] a proposed taxonomic designation 

(Mark with a cross where applicable) 



III. RECEIPI" AND ACCEPfANCE 



This International Depositary Authority accepts the microorganism identified under I above, 
which was received by it on November 5 1997. 

IV. RECEIPT OF REQUEST FOR CONVERS ION 



The microorganism identified under I above was received by this International Depositary 
Authority on and a request to convert the original deposit to a deposit 

under the Budapest Treaty was received by it on 



V. INTERNATIONAL DEl'OSITARY AUTHORITY 



Name: Korea Research Institute of 


Signature(s) of person(s) having the power to 


Bioscience and Biotechnology 


represent the International Depositary 


Korean Collection forType Cultures 


Authority or of authorized official(s): 


Address: KCTC. KRIBB 




#52. Oun-dorig. Yusong-ku. 




Taejon 305-333. 


Kyung Sook Bae. Curator 


Republic of Korea 


Date: November 12 1997 
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INTERNATIONAL FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 

issued pursuant to Rule 7. 1 

TO. LEE. Yong-Hywi 

Department of Genetic Engineering College of Natural Sciences. 
Kyungpook National University. Taegu 702-701. 
Republic of Korea 



I. IDENTIFICATION OF THE MICROORGANISM 



ISlSrrrS rCfCrCnCC *' Ven ^ ^ ACl ^'° n " Umber ,he 

iJrJ^JbiiOK; INTERNATIONAL DEPOSITARY 

„ i AUTHORITY 

| KCTC OS68BP 



II. SCIENTT RC DESCRIPT IO N AND/OR PROPOSED TAXONOM1C DESIGNATION 



The mienxwganism identified under I above was accompanied by. 

[ x J a scientific description 

I I a proposed taxonomic designation 

(Mark with a cniss where applicable) 



III- RECEIPT AND ACCEPTANCE 



This International Depositary Authority accepts the microorganism identified under 1 above 
which was received by it on -January 18 1999. 



IV. RECEIPT OF REQUEST FOR CONVERSION 



The microorganism identified under I above was received by this International Depositary 
Authority on and a request to convert the original deposit to a deposit 

under the Budapest Treaty was received by it on 

V. INTERNATIONAL DEPOSITARY AUTHORITY^ 



Name; Korean Cotection for Type Cultures j Signature^} of persons) havi ng the power to 

1 represent the IntemationiU Depositary 
A uihori ry or of auth orized officials) 

Address. Korea Research Institute of ^- ./ 

Bioscience and Biotechnology ! 
(KRIBB) j 
#52. Oun-«k>ng, Yusong-ku. j 

Taejon 305-333. \ PARK Yong-Ha. Director 

Republic of Korea j D ate^ January 25 1999 
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WHAT IS CLAIMED : 

1. A polyhydroxyalkanoate biosynthesis-related DNA fragment, 
comprising a gene for polyhydroxyalkanoate synthase, a gene for p- 

5 ketothiolase and a gene for acetoacetyl-CoA reductase, which are all derived 
from Alcaligenes latus. 

2. A polyhydroxyalkanoate biosynthesis-related DNA fragment as set 
forth in claim 1, wherein said fragment contain the gene for 

10 polyhydroxyalkanoate synthase, the gene for (3-ketothiolase and the gene for 
acetoacetyl-CoA reductase in due order and has a base sequence of Sequence 1 . 

3. A polyhydroxyalkanoate biosynthesis-related DNA fragment as set 
forth in claim 1 or 2, wherein the gene for polyhydroxyalkanoate synthase has a 

15 base sequence of Sequence 2. 

4. A polyhydroxyalkanoate biosynthesis-related DNA fragment as set 
forth in claim 1 or 2, wherein the gene for p-ketothiolase has a base sequence 
of Sequence 3. 

20 

5. A polyhydroxyalkanoate biosynthesis-related DNA fragment as set 
forth in claim 1 or 2, wherein the gene for acetoacetyl-CoA reductase has a 
base sequence of Sequence 4. 

25 6. A polyhydroxyalkanoate synthase, having an amino acid sequence of 

Sequence 5, derived from Alcaligenes latus. 

17 
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7. A p-ketothiolase, having an amino acid sequence of Sequence 6, 
derived from Alcaligenes latus. 

8. An acetoacetyl-CoA reductase, having an amino acid sequence of 
5 Sequence 7, derived from Alcaligenes latus. 

9. A recombinant expression vector pJCl, containing the 
polyhydroxyalkanoate biosynthesis-related gene of claim 1 . 

10 10. A recombinant expression vector pAL32, containing the gene for 

polyhydroxyalkanoate synthase of claim 3. 

1 1. . A recombinant expression vector pKTC32, containing the gene for 
polyhydroxyalkanoate synthase of claim 3. 

15 

12. An E. coli transformant XLl-Blue/pJCl with a deposition No. of 
KCTC 0398 BP, which is transformed with the recombinant expression vector 
of claim 9. 

20 13. An Alcaligenes eutrophus transformant LAR5 (DSM541/pKTC32) 

with a deposition No. KCTC 0568 BP, which is transformed with the 
recombinant expression vector of claim 1 1 . 

14. A method for preparing polyhydroxyalkanoate biosynthesis-related 
25 enzymes, by culturing the E. coli transformant of claim 12. 

15. A method for preparing polyhydroxybutyrate synthase, by culturing 
A. eutrophus transformant of claim 13. 

18 
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16. A method for producing polyhydroxyalkanoate and its copolymers, 
by culturing the transformant of claim 12. 

17. A method for producing polyhydroxyalkanoate and its copolymers, 
5 by culturing the transformant of 13. 
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FIG. 3a 



10 20 30 40 50 60 

GGATCCTGCT GCGCTCGGAC AAAAGCATGG GCCGAGTTTA GCGCGCGCCC TCGGACGCCC 
70 80 90 100 110 120 

CCGGCAGCGT GCAGGGTTCA CGCCATGTTC AAAAGCGCTG TGAGGCAGGT ATGCTGCACT 

•30 140 150 160 170 180 

GCGTCAATCC CGCAGTTCCG CAGTCATCCC AGAAATGCAG CTGTACAACT ACTTTCGCTC 
190 200 210 220 230 240 

CTCGGCGTCC TACCGCGTCC GCATCGCACT GGCCCTGAAG GGTCTGGCCT ACG AATACAA 
250 260 270 280 290 300 

GCCGNTGCAC CTGCAGAAGA AGGAGCAGTT CGCGGANTCG TATGCGGCCG TGTCGGCCTC 
310 320 330 340 350 360 

GCGCCTGGTG CCGCTGCTGC GCGACGGCGA CGCGTCGCTG ACGCAGTCGA TGGCCATCAT 
370 380 390 400 410 420 

CGAGTACCTG GACGAGACCC ATCCGCAGCC GCCGCTGCTG CCCTCGGACC CGCTGGGCCG 
430 440 450 460 470 480 

CGCCCGCGTG CGTGCGCTGG CGCAGGACAT CGCCTGCGAG ATCCACCCGC TCAACAACCT 
490 500 510 520 530 540 

GCGCGTGCTG CGCTACCTGG CGCACGACCT CAAGGTCGGC GAGGACGACA AGAACCGCTG 
550 560 570 580 590 600 

GTACCGCCAC TGGGTCGAGA CCGG CCTGG A GGTGGTGG AG CGCCAGCTGG CGGATCACCC 

610 620 630 640 650 660 

GTCCACCGGC CGCTTCTGCC ATGGCGACAC GCCCGGCCTG GCCGATTGCG TGCTGGTGCC 
670 680 690 700 710 720 

GCAGATCTTC AACGCCCAGC GTTTCAACTG CCGGCTGGAG CACGTGCCCA CCGTGATGCG 
730 740 750 760 770 780 

CGTGTACGAG GCCTGCATGC AGCTCGACGC CTTCGACAAG ACGCAGCCCT CCGCCTGTCC 
790 800 810 820 830 840 

CGATGCCGAG TAAGGCTCTG CAGGGCGTGC TGAGGCCCGA GTGGCCGGCA CCGGCCGGCG 
850 860 870 880 890 900 

TGGGCGCATT CATGAGCACG CGCGAGGGCG GCGTCAGCGC CGCGCCCTGG GACGGCGCCA 
910 920 930 940 950 960 

ACCTGGGCGA CGCCGTGGGC GACAGCCCGC AGGCTGTGGA CACCAACCGC GCCCGATTCG 
970 980 990 1000 1010 1020 

CCGCCGCCGC CGAGGGCGGC ACGCCGGTGT GGCTGCGCCA GGTCCACGGC ACGCGGGTGC 
1030 1040 1050 1060 1070 1080 

TGCGATTGCG CGCCGGCGAG GCCTTGCCGG CGCAGCCGCC CGAGGCCGAT GCCGTGGTCA 
1090 1100 1110 1120 1130 1140 

CCGCCGACCC CGGCCTGGTG TGCGTGGTGC AGGTGGCGGA CTGCCTGCCC GTGTTCTTCG 
1150 1160 1170 1180 1190 1200 

CAGCGTCCAA CGGCCGTGCC GTCGGCGCTG CGCATGCGGG CTGGCGCGGC CTGGCCGGTG 
1210 1220 1230 1240 1250 1260 

GCGTGCTCGA AAACACGCTG GCCGAGGTGT GCGCGCTGGC GCGCTGCGAG CCCTCCGATG 
1270 1280 1290 1300 1310 1320 

TGCTGGCCTG GATGGGGCCC TGCATCGGGC CGGAGAGTTT CGAGGTGGGG CGCGACGTGC 
1330 1340 1350 1360 1370 1380 

TGGAGGGTTT CGGCGTGGAT CCGGACGGTC CGGCCGACCC GGCCTTCGCC TGGCGTCCGC 
1390 1400 1410 1420 1430 1440 

GTGCCGACGG CAGCGCGCGC TGGCTGGCGG ACCTGCCGGG GCTGGCGCGG CGCCGGCTCG 
1450 1460 1470 1480 1490 1500 

AATTGGCAGG TCTGCGTCAG ATCAGTGGCG GACAGTGGTG CACGGTGCAG GATCGTTCAC 
1510 1520 1530 1540 1550 1560 

GGTTCTTCTC GTTCCGGCGG GACCGGGTCA CGGGGCGGCA GGCTGCCGCC GTCTGGCTGC 
I s70 1580 1590 1600 1610 1620 

GCGGATGAAG CGGTGTCCTC GGCGCGCTTG CGCGCCCGTC GCCGCGCCGG CGTCCCCAGG 
1630 1640 1650 1660 1670 1680 
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FIG. 3b 



AAGTACAGGA CGATGGACAA GGGCAGTACG CCATACAGCA GCAGCGTGAA CACCGCGCCG 

1690 1700 1710 1720 1730 1740 

AGCAAGGTGC CGTTGGGCGC CATGGCTTCG GCCACGGCCA TCATCAGCAC CACGTACAGC 

1750 1760 1770 1780 1790 1800 

CATGCCAGAG CAACCAAGTA CATAGCAAAA ACCCGCAATT ACGCAGAATG ACGTATTTCG 

1810 1820 1830 1840 1850 I860 

TACAATGAAA ACTGTTGTCA TGATGCGGTA AGACACGAAG CCTACAACGC GATCCAGCAA 

1870 1880 1890 1900 1910 1920 

CGGTTTTCGT GAAAAAGTCC TCAGGAGACG AGCGTGACAC TGCATCCCAT TCCCGCACTG 

1930 1940 1950 1960 1970 1980 

CAACAGCTTG GCGACAACGC CACGGCGCTG AGTGCCGCCA TCTCGGAAGC GCTGCGCGCG 

1989 1998 2007 2016 2025 2034 

ATG TCG GGC CTG AAC CTG CCG ATG CAG GCC ATG ACC AAG CTG CAG GGC GAG TAC 

MSGLN LP MQAMTKLQG EY 
pliaC A , 

2043 2052 2061 2070 2079 2088 

CTC AAC GAG GCG ACG GCG CTG TGG AAC CAG ACG CTG GGC CGC CTG CAG CCC GAC 
LNEATA LWN QTLG RL Q P D 

2097 2106 2115 2124 2133 2142 

GGC AGC GCC CAA CCG GCC AAG CTG GGC GAC CGG CGC TTC TCG GCC GAG GAC TGG 
GS AQPA KLG DRRFSAE DW 

2151 2160 2169 2178 2187 2196 

GCC AAG AAC CCC GCC GCG GCC TAC CTG GCG CAG GTC TAC CTG CTC AAT GCC CGC 
A KNPAA AYL AQVYLL NAR 

2205 2214 2223 2232 2241 2250 

ACG CTG ATG CAG ATG GCC GAG TCC ATC GAG GGC GAC GCC AAG GCC AAG GCG CGC 
T LMQ MAE S I E G D AKA-KAR 

2259 2268 2277 2286 2295 • 2304 

GTG CGC TTC GCC GTG CAG CAG TGG ATC GAC GCC GCG GCG CCG AGC AAC TTC CTG 
V-RFAVQ QWI DA AAP SNFL 

2313 2322 2331 2340 2349 2358 

GCG CTC AAT CCC GAG GCG CAG CGC AAG GCG CTG GAG ACC AAG GGG GAG^GC ATC 
ALNPEAQRKALE TKGE SI 

2367 2376 2385 2394 2403 2412 

AGC CAG GGC CTG CAG CAG CTG TGG CAT GAC ATC CAG CAG GGC CAC GTG TCG CAG 
SQGLQ'QLWHDI QQGUVSQ 

2421 2430 2439 2448 2457 2466 

ACG GAC GAG AGC GTG TTC GAG GTG GGC AAG AAC GTC GCC ACC ACC GAG GGC GCG 
TDE SVFE VGKNVATTEGA 

2475 2484 2493 2502 2511 2520 - 

GTC GTG TAC GAG AAC GAC CTG TTC CAG CTC ATC GAG TAC AAG CCG CTG ACG CCC 
VVYEN D LFQLIEY KP LTP 

2529 2538 2547 2556 256S 2574 

AAG GTG CAC GAG AAG CCG ATG CTG TTC GTG CCG CCG TGC ATC AAC AAG TAC TAC 
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FIG. 3c 



KVHEK P MLF V PPCINKYY 

2583 2592 2601 2610 2619 2628 

ATC CTG GAC CTG CAG CCG GAC AAC AGC CTC ATC CGC TAC ACC GTC GCC CAG GGC 
'LDLQP DN SLIR Y TV A Q G 

2637 2646 2655 2664 2673 2682 

CAC CGG GTG TTC GTG GTG AGC TGG CGC AAC CCC GAC GCC TCC GTC GCC GGC AAG 
H R V F V VS WR N PDA SV AGK 

2691 2700 2709 2718 2727 ?736 

ACC TGG GAC GAC TAC GTG GAG CAG GGC GTG ATC CGC GCC ATC CGC GTG ATG CAG 
TWDDYVEQGVIRAI RVMQ 

2745 2754 2763 2772 2781 2790 

CAG ATC ACG GGG CAC GAG AAG GTC AAC GCG CTG GGC TTC TGC GTC GGC GGC ACC 
QITGHE KVNALGFCVGGT 

2799 2808 2817 2826 2835 2844 

ATC CTG AGC ACG GCG CTG GCG GTG CTG GCC GCG CGC GGC GAG CAG CCC GCG GCG 
•LSTALAVLAARGE QP AA 

2853 2862 2871 2880 2889 2898 

AGC CTG ACG CTG CTG ACC ACG CTG CTG GAC TTC AGC AAC ACC GGC GTG CTG GAC 
SL TLLTTLLDFS NTGVLD 

2907 2916 2925 2934 2943 2952 

CTG TTC ATC GAC GAG GCC GGC GTG CGC CTG CGC GAG ATG ACC ATC GGC GAG AAG 
L F I D E A G VRLREMTIGEK 

2961 2970 2979 2988 2997 3006 

GCG CCC AAC GGC CCG GGC CTG CTC AAC GGC AAG GAG CTG GCC ACC ACC TTC AGC 
A PNG P G L LNG KE L ATTFS 

3015 3024 3033 3042 3051 3060 

TTC CTG CGC CCG AAC GAC CTG GTC TGG AAC TAC GTG GTG GGC AAC TAC CTC AAG 
F L R P ND L VWNYV V GNYLK 

3069 3078 3087 3096 3105 3114 

GGC GAG GCG CCG CCG CCC TTC GAC CTG CTG TAC TGG AAC TCC GAC AGC ACC AAC 
GEA P P P F DLLY WNS D STN 

3123 3132 3141 3150 3159 3168 

ATG GCC GGG CCC ATG TTC TGC TGG TAC CTG CGC AAC ACC TAC CTG GAG AAC AAG 
■MAG PMFCWYL RN TYLE NK 

3177 3186 3195 3204 3213 3222 

TTG CGC GTT CCC GGT GCC CTG ACC ATC TGC GGC GAG AAG GTG GAC CTC TCG CGC 
LRVPGAL TICGEKVDLSR 

3231 3240 3249 3258 3267 3276 

ATC GAG GCG CCG GTG TAC TTC TAC GGT TCG CGC GAG GAC CAC ATC GTG CCC TGG 
IEA P V YFYGSREDHIVPW 

3285 3294 3303 3312 3321 3330 
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FIG. 3d 



G AA TCG GCC TAC GCC GGC ACG CAG ATG CTG AGC GGC CCC AAG CGC TAT GTC CTG 
ES A YAG T QML SG P KR Y V L 

3339 3348 3357 3366 3375 3384 

GGT GCG TCT GGC CAC ATC GCC GGC GTG ATC AAC CCC CCG CAG AAG AAG AAG CGC 
GASGHIAGVIN PPQKKKR 

3393 3402 3411 3420 3429 3438 

AGC TAC TGG ACC AAC GAG CAG CTC GAC GGC GAC TTC AAC CAG TGG CTG GAA GGC 
SYWTNEQ L DGDFNQW LEG 

3447 3456 3465 3474 3483 3492 

TCC ACC GAG CAT CCT GGC AGC TGG TGG ACC GAC TGG AGC GAC TGG CTC AAG CAG 
S T E H P G SWWT DWS DWLKQ 

3501 3510 3519 3528 3537 3546 

CAC GCG GGC AAG GAA ATC GCC GCA CCC AAG ACT CCC GGC AAC AAG ACC CAC AAG 
HAGKEIAAPKTPGNKTHK 

3555 3564 3573 3582 

CCC ATC GAG CCC GCC CCC GGG CGT TAC GTG AAG CAG AAG GCC 
PIEPAPG RYVKQKA 

3600 3610 3620 3630 3640 

TG AGCCGCGGCC CCTGAGCCTT CTTTAACCCG ACCTTGACAA ACGAGGAGAT AAGC 



3653 3662 3671 3680 3689 3698 

ATG ACC GAC ATC GTC ATC GTC GCC GCA GCC CGC ACC GCC GTG GGC AAG TTC GGC 

MT D I V I V AA A R TAV G KFG 
phaA M -+ 

3707 3716 3725 ■ 3734 3743 3752 

GGC ACG CTG GCC AAG ACC CCC GCT CCG GAG CTG GGC GCC GTG GTC ATC AAG GCC 
GTLAK.TPAPEL GAVV I K A 

3761 3770 3779 3788 3797 3806 

CTG CTG GAG AAG ACG GGC GTC AAG CCC GAC CAG ATC GGT GAA GTC ATC ATG GGC 
LLEKTGVKPDQIGEV IMG 

3815 3824 3833 3842 3851 3860 

CAG GTG CTG GCC GCC GGC GCG GGC CAG AAC CCC GCG CGC CAG GCG ATG ATG AAG 
QVLAAG AG QNPA RQA MMK 

3869 3878 3887 3896 3905 3914 

• GCG GGC ATC GCC AAG GAA ACG CCG GCG CTG ACC ATC AAC GCC GTG TGC GGG TCC 
AGIAKETPALTI NAVCGS 

3923 3932 3941 3950 3959 3968 

GGC CTC AAG GCC GTG ATG CTG GCC GCC CAG GCC ATC GCC TGG GGC GAC AGC GAC 
G L K A V M L A A Q AI AW GD SD 

3977 3986 3995 4004 4013 4022 

ATC GTC ATC GCC GGC GGC CAG GAG AAC ATG AGC GCC AGC CCG CAC GTG CTG ATG 
IVIAGGQENMSA SPHVLM 
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FIG. 3e 



4031 4040 4049 4058 4067 4076 

GGC AGC CGC GAC GGC CAG CGC ATG GGC GAC TGG AAG ATG GTC GAC ACC ATG ATC 
G SRDGQRM GDWKMVDTMI 

4085 4094 4103 4112 4121 4130 

AAC GAC GGC CTG TGG GAC GTG TAC AAC AAG TAC CAC ATG GGC ATC ACG GCC GAG 
N DGLWDVY NKYHMG I TAE 

4139 4148 4157 4166 4175 4184 

AAC GTC GCC AAG GAA CAC GAC ATC AGC CGC GAC CAG CAG GAC GCC CTG GCC CTG 
NVAKEH D 1 SRDQQ DALAL 

4193. 4202 4211 4220 4229 4238 

GCC AGC CAG CAG AAG GCC ACC GCC GCG CAG GAA GCC GGC CGC TTC AAG GAC GAG 
A SQQ K A TA AQE A GRF K D E 

4247 4256 4265 4274 4283 4292 

ATC GTT CCG GTC TCG ATC CCG CAG CGC AAG GGC GAC CCG GTG CTG TTC GAC ACC 
I VPVS I P QRKG DP V LFDT 

4301 4310 4319 4328 4337 4346 

GAC GAG TTC ATC AAC AAG AAG ACC ACC GCC GAA GCG CTG GCG GGC CTG CGC CCG 
DE FIN KKTTAE ALA GLRP 

4355 4364 4373 4382 4391 4400 

GCC TTC GAC AAG GCC GGC AGC GTG ACC GCG GGC AAC GCC TCG GGC ATC AAC GAC 
A FDKAG SV TAGN ASG I ND 

4409 4418 4427 4436 4445 4454 

GGC GCC GCT GCG GTG ATG GTG ATG TCC GCC GCC AAG GCG AAG GAG CTG GGC CTG 
G A AAV MV MSAA K A KE.L G L 

4463 4472 4481 4490 4499 4508 

ACG CCC ATG GCG CGC ATC AAG AGC TTC GGC ACC AGC GGC CTG GAT CCG GCC AAG 
T-PMAR I KS FG T S G LD P A K 

4517 4526 4535 4544 4553 4562 

GTC AAC GTC AAC GGC GGT GCC ATC GCC ATC GGC CAC CCC ATC GGC GCC TCC GGC 
VNVNGG AI AIGHP I Ga'sg 

4571 4580 4589 4598 4607 4616 

TGC CGC GTG CTG GTG ACG CTG CTG CAC GAG ATG CAG CGC CGG GAC GCC AAG AAG 
CRVLVTLLHE MQRRDAK.K 

4625 4634 4643 4652 4661 4670 

GGC CTG GCC GCG CTG TGC ATC GGC GGC GGC ATG GGC GTG TCG CTG ACC GTC GAG 
GLAALC 1 GGGMGV SLTVE 



CGC 
R 

4680 4690 4700 4710 4720 4730 

TGATCAG AAGAACCGGG CGGCCCCGCG CCGCCCGCCC GGCGTTCCAC GCGGGTGCGC 
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FIG. 3f 



4740 4 750 4760 4 770 4 7 80 4790 

CGGGATACCA GACGAACCAA ACCACCAAGG GCTTCGAGAC GGCCCGAAGA AGGAGAGACA 

G 

4800 4809 4818 4827 4836 4845 

ATG GCA CAG AAA CTG GCT TAC GTG ACC GGC GGC ATG GGC GGC ATC GGC ACC TCG 

MAQKL AY V TG GMGG I G TS 
pliaB A , -+ 

4854 4863 1872 4881 4S90 4899 

ATG TGC CAG CGC CTG CAC AAG GAC GGC TTC AAG GTG ATC GCC GGC TGC GGT CCG 
MCQRL H K D GFK V I a GC GP 

4908 4 917 4926 4935 4944 4953 

AGC CGC GAC CAC CAG AAG TGG ATC GAT GAA CAG GCC GCG CTG GGC TAT ACC TTC 
S RDH QK W I DE Q A AL GYTF 

4962 4971 4980 4989 4998 5007 

TAC GCC TCC GTG GGC AAC GTG GCC GAC TGG GAC TCC ACC GTG GCC GCC TTC GAG 
Y ASV GNV ADW DSTVAAFE 

5016 5025 5034 5043 5052 5061 

AAG GTC AAG GCC GAG CAC GGC ACC GTG GAC GTG CTG GTG AAC AAC GCC GGC ATC 
KVKAE RG TVDV L VNN AG I 

5070 5079 5088 5097 5106 5115 

ACG CGT GAC GGG CAG TTC CGC AAG ATG AGC AAG GCC GAT TGG CAG GCC GTG ATG 
T RDGQFRKMS KADWQAVM 

5124 5133 5142 5151 5160 5169 

TCG ACC AAC CTC GAC AGC ATG TTC AAC GTC ACC AAG CAG GTG ATC GAG GGC ATG 
STNLD.SMFNVTKQVIE GM 

5178 5187 5196 5205 5214 5223 

CTG GAC AAG GGC TGG GGC CGG ATC ATC AAC ATC TCC TCG GTC AAC GGC GAG AAG 
LDKGWGRII N1SSVNGEK 

5232 5241 5250 5259 5268 5277 

GGC CAG TTC GGC CAG ACC AAC TAC TCC GCC GCC AAG GCC GGC ATG CAC GGC TTC 
GQFG Q TNY S A A K AG MHGF 

5286 5295 5304 5313 5322 5331 

TCC ATG GCG CTG GCG CAG GAA GTG GCG GCC AAG GGC GTG ACG GTG AAC ACC GTG 
SMALAQE VAAKGVTV NTV 

5340 5349 5358 5367 5376 5385 

AGC CCG GGC TAC ATC GCC ACG GAC ATG GTC AAG GCC ATC CGC CAG GAC GTG CTG 
SPGYIATDMVKAIRQDVL 

5394 5403 5412 5421 5430 5439 

GAC AAG ATC ATC GCC ACC ATT CCC ATC CGT CGC CTG GGT ACG CCG GAG GAG ATC 
DK11A TIPIRRLGTPEE I 

5448 5457 5466 5475 5484 5493 
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FIG. 3g 



GCC TCC ATC TTC CCC TGG CTG GCC GGC GAA GAA TCG GGC TTC ACC ACC GGT GCC 

fpwlagees gfttga 

5502 5511 5520 

GAC TTC AGC TGC AAC GGC GGC CTG CAC ATG GGC 
D F -S CN GGL HMG 

5530 5540 5550 5560 5570 5580 

TGAG GCCCGCGGCT CCATGCCCAC CTGCGTGGGC ATGGACGGGC CGAAGGACCG 

3390 5600 5610 5620 5630 sfian 

AGCTCTGCGA GGGTGCGGCC TGCAAGGCTQ AGGCCTGCTG CGCCGCGTGC CCGCGAGGGC 

56^0 5660 5670 5680 ' 5690 5700 

ACGTOCCOAA CCACCAAAAO GCCGCGCATT GCGCGGCCTT TTCCTTTCTG GATCGGTGCG 
5710 5720 57 30 5740 5750 -;7fin 

GACGGGTGCC GCGTCAGGCA GGGCAGGCCC CCGGCCTTCA CTCCACCATG CCGCACATGA 
5770 57X0 5? 90 5800 5810 sr™ 

AGTACTTGAT CACCCTTTGG CCGCGAAGCC CAGCATGCCG AAGCCCAGCG CCAGGAACAG 
58j0 5840 5850 5860 5870 5RR0 

CACGAAGGTG CCGAACTTGC CGGCCTTCGA CTCGCGCGCG AGCTGAAAGA TGATGAATGC 
5890 5900 5910 5920 5930 5940 

CATGTAGAGC ATGAAGGCCG TGACGCCGAC GGTCAGGCCC AGCTGGGCAA TGTTTTCCTC 

5950 5960 5970 5980 5990 6000 

GTTGATTTCG AACATCGTTT GTTGTCTCAG GCTGCTGCCA CGCGGCTGAC GTGCTCGCCG 
6010 6020 6030 6040 6050 6060 

CGCGGCCGGG CCCCAACTGC CCGCAGCGGT TCTCGATCAG GTTCTCAAGG CATCTCGTGC 
6070 6080 6090 6100 6110 6 pn 

CACTGGGAGG TGTCCACCAG GTCGCGGTAG GCGTGCCAGC TCGAATGCGC CAGCCACGGC 
6U0 6,40 6150 6160 6170 6180 

ACTACCACGA TCAGGCCCAG CAGCAGCGTG GCCATGCCCA GCAGCGTCAG CGCCATGATC 
6190 6200 6210 6220 6230 6740 

AGCGCCGCCC ACAGCGCCAG CGGCAGTGGG TGCTGCATCA CCACGCGCCA GCTCGTGAGC 

6250 6260 6270 6280 

ACCGCCACCA GCACGCCCAC GTGGCGGTCC AGCAGCATCG GGATCC 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT : LG CHEMICAL LTD. 
5 LEE, Sang Yup 

CHOI, Jong-il 
CHOO, Seung-Ho 
YOON, Hye-Sung 
HAN, Kyuboem 
io SONG, Ji-Yong 

LEE, Yong-Hyun 
HUH, Tae-Lin 
HONG, Sung-Kook 

(ii) TITLE OF INVENTION : POLYHYDROXYALKANOATE 
15 BIOSYNTHESIS-RELATED GENES DERIVED 

FROM Alcaligenes latus 

(iii) NUMBER OF SEQUENCES : 8 

(2) INFORMATION FOR SEQ ID NO. : 1: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6436 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

25 (ii) MOLECULAR TYPE : oligonucleotide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO. : 1 : 
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GGATCCTGCT 


GCGCTCGGAC 


AAAAGCATGG 


GCCGAGTTTA 


GCGCGCGCCC 


TPPPAPPPPP 

1 bbbl/ibbbbb 






CCGGCAGCGT 


GCAGGGTTCA 


CGCCATGTTC 


AAAAGCGCTG 


TGAGGCAGGT 


ATP/TPPAPT 

f\ 1 Ub I Ubrtb 1 






GCGTCAATCC 


CGCAGTTCCG 


CAGTCATCCC 


AGAAATGCAG 


ptp,tapaapt 

b 1 u 1 Awi/ib 1 


APTTTPPPTP 
nbl 1 Ibbblb 






CTCGGCGTCC 


TACCGCGTCC 


GCATCGCACT 


p,PtPpptp,aap, 

UVJbbb 1 UMU 


pptptppppt 

uu J. b 1 bbbb 1 


APPA ATAPA A 
AbbrtAlAbA-\ 


240 


5 


GCCGGTGCAC 


CTGCAGAAGA 


AGGAGCAGTT 


rcrc/iACTrr, 

bVjbuurlU 1 Lu 


tatppppppp 

Irtlbbbbbbb 


TPTPPPPPTP 
lblbbbbblb 


300 




GCGCCTGGTG 


CCGCTGCTGC 


GCGACGGCGA 


pppptppptp 

bubblbbblb 


apppaptppa 

AbbbAb 1 bbA 


TPPPP ATP AT 

IbbbbAlbAl 


360 




CGAGTACCTG 


GACGAGACCC 


ATCCGCAGCC 


bbbbblbblb 


rmrnr app 
bbblbbbAbb 


bbblbbbbbb 


420 




CGCCCGCGTG 


bU 1 UVUU 1 VJU 


pppappapat 


bbbblbbbAb 


atpp a rrnrr 
AlbbAbbbbb 


TCAACAACCT 


480 




GCGCGTGCTG 


PPtPTAPPTPP 
tub 1 nbb 1 vjrtj 


pppappappt 

bbbnbbttbb 1 


bAAbblbbbb 


GAGGACGACA 


AGAACCGCTG 


540 




GTACCGCCAC 


tpPtPtppapa 


pppppptpp a 


PPTPPTPPAP 

bblbblbbAb 


CGCCAGCTGG 


CGGATCACCC 


600 




GTCCACCGGC 


bubl IbluvL 


atppppapap 

n 1 u\jbb/\bnb 


bbbbbbbb 1 b 


GCCGATTGCG 


TGCTGGTGCC 


660 




GCAGATCTTC 


aapppppapp 

iinbbbbbnbb 


PTTTPA APTP 
bill bAAb 1 b 


bbbbblbbAb 


CACGTGCCCA 


CCGTGATGCG 


720 




CGTGTACGAG 


p/ttppatp,p 

Ubb lubttlbb 


apptpp appp 

nbb 1 bbnbbb 


PTTPP AP A A P 

bl IbbAbAAb 


ACGCAGCCCT 


CCGCCTGTCC 


780 




CGATGCCGAG 


TAAPP.PTPTP 

J /i/iUUb I b 1 b 


bnbbbbb 1 bb 


TP APPPPPP A 

IbAbbbbbbA 


blbbbbbbbA 


CCGGCCGGCG 


840 




TGGGCGCATT 


patPtAP,papp 

brt 1 VjnVjrbnbu 


ppppappppp 

bubUrtbbbbb 


bbblbAbbbb 


bbbbbbblbb 


GACGGCGCCA 


900 




ACCTGGGCGA 


pp/tp/tcp/ip 

bUbbU 1 Uuub 


pap apppppp 

unbrtubbbbb 


A PPPTPTPP A 

AbbblblbbA 


pappa 

bAbbAAbbbb 


GCCCGATTCG 


960 




CCGCCGCCGC 


bunuuvjbuvjb 


app,pppptpt 


bbbibbbbbA 


pj^Tnr apppp 
bblbbAbbbb 


Abbbbbblbb 


1020 




TGCGATTGCG 


CGCCGGCGAG 


GCCTTGCCGG 


bUbftUbbbbb 


PPAPPPPPflT 
bbAbbbbbA 1 


bbbblbbibA 


1080 




CCGCCGACCC 


CGGCCTGGTG 


TGCGTGGTGC 


apptpp,pp,pa 

rtbru 1 UbJbbbA 


PTPPPTPppp 
b 1 bbb 1 bbbb 


PTPTTPTTPP 
blbl lbi Ibb 




20 


CAGCGTCCAA 


CGGCCGTGCC 


GTCGGCGCTG 


CGCATGCGGG 


PTPPPPPPPP 
b l bbbbbbbb 


PTPHPPPTTP 
b 1 bbbbbb 1 b 


1200 




GCGTGCTCGA 


AAACACGGTG 


GCCGAGGTGT 


GCGCGCTGGC 


PPPPTPPPAP 
Ubub i bbbnb 


crrrrrr atp 

bbb 1 bbb/i 1 b 






TGCTGGCCTG 


GATGGGGCCC 


TGCATCGGGC 


CGGAGAGTTT 


ppapx;tp/^p 


PPPP APPTPP 
bbbbnbb 1 bb 






TGGAGGGTTT 


CGGCGTGGAT 


CCGGACGGTC 


CGGCCGACCC 


PPtPPTTPP/T 

VJbJbbl IbVjbb 


TPPPPTPPPP 
1 Ubbb 1 bbbb 






GTGCCGACGG 


CAGCGCGCGC 


TGGCTGGCGG 


apptyippppp 

abb 1 Ubbuuu 


bblbbbbbbb 


bbbbbbblbb 


1440 


25 


AATTGGCAGG 


TCTGCGTCAG 


ATCAGTGGCG 


GACAGTGGTG 


CACGGTGCAG 


GATCGTTCAC 


1500 




GGTTCTTCTC 


GTTCCGGCGG 


GACCGGGTCA 


CGGGGCGGCA 


GGCTGCCGCC 


GTCTGGCTGC 


1560 




GCGGATGMG 


CGGTGTCCTC 


GGCGCGCTTG 


CGCGCCCGTC 


GCCGGGCCGG 


CGTCCCCAGG 


1620 




AAGTACAGGA 


CGATGGACAA 


GGGCAGTACG 


CCATACAGCA 


GCAGCGTGAA 


CACCGCGCCG 


1680 




AGCAAGGTGC 


CGTTGGGCGC 


CATGGCTTCG 


GCCACGGCCA 


TCATCAGCAC 


CACGTACAGC 


1740 
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CATGCCAGAG 


CAACCAAGTA 


CATAGCAAAA 


Accrcr A ATT 


APPP APA ATP 


A PPT A TTTPP 
AbbiAl 1 Ibb 






TACAATGAAA 


ACTGTTGTCA 


TGATGCGGTA 


AP.ArAPr.AAP 


PPTAPA APPP 
bb 1 nbrtnbbb 


P ATPPAPPA A 
bAlbbAbbAA 






CGGTTTTCGT 


GAAAAAGTCC 


TCAGGAGACG 


AP.PP.TP.APAP 


TPPATPPPAT 
IbbAlbbbAl 


Trrrrj^ a ptp 
IbbbbbAbib 


1920 




CAACAGCTTG 


GCGACAACGC 




APTPPPPPPA 
Ab 1 bbbbbbA 


TPTPPPA APP 

IblbbbAAbb 


bblbbbbbbb 


1980 




ATGTCGGGCC 




r,ATP,PAPnpp 


ATPAPPA APP 
AlbAbbAAbb 


TPP A PPPPPA 

ibbAbbbbbA 


PT A PPTP A AP 

blAbblbAAb 


2040 






ppptp.tpp.a a 


PPAPAPPPTP 


bbbbbbblbb 


AbbbbbAbbb 


CAGCGCCCAA 


2100 




ppppppa app 


tpppppappp 

l bbbbbAbbb 


PPPPTTPTP^ 

bbbbl Iblbb 


bbbbAbbALl 


GGGCCAAGAA 


CCCCGCCGCG 


2160 




ppptapptpp 


ppp apptpt a 
bbLAbblH A 


mrrrr a at 
bblbblbAAl 


GCCCGCACGC 


TGATGCAGAT 


GGCCGAGTCC 


2220 




atppappppp 

AlbbAbbbbb 


appppa a ppp 
AbbbbAAbbb 


bAAbbbbbbb 


GTGCGCTTCG 


CCGTGCAGCA 


GTGGATCGAC 


2280 




bbbbbbbbbb 


pp a pp a a ptt 
LbAbLAAL 1 1 


bLlbbbbblb 


AATCCCGAGG 


CGCAGCGCAA 


GGCGCTGGAG 


2340 




app a sfYYyr 
AbbAAbbbbb 


AbAbbAlLAb 


CCAGGGCCTG 


CAGCAGCTGT 


GGCATGACAT 


CCAGCAGGGC 


2400 




p a pptptppp 
LAlblblbbb 


AP APPP APP A 

AbAbbbALbA 


GAGCGTGTTC 


GAGGTGGGCA 


AGAACGTCGC 


CACCACCGAG 


2460 




uutAjbbb 1 bb 


TPTAPPAPA A 

iblAbbAbA'l 


PP A PnYTTP 

bbAbbibl lb 


CAGCTCATCG 


AGTACAAGCC 


GCTGACGCCC 


2520 




AAPPTPPAPP 


apa Acrrr at 
AbAAbbbbAl 


PPTPTTPPTP 

bblbl Ibblb 


CCGCCGTGCA 


TCAACAAGTA 


CTACATCCTG 


2580 




papptptapp 


pppapaapap 


PPTP ATPPPP 

bblbAlbbbb 


TACACCGTCG 


CCCAGGGCCA 


CCGGGTGTTC 


2640 




ptpptpappt 


PPPPPA APPP 
bbbbbAAbbb 


pp a rnrcTrr 
bbAbbbblbb 


blbbbbbbbA 


AGACCTGGGA 


CGACTACGTG 


2700 




PAPPAPPP.PP 


TPATPPPPPP 
IbAlbbbbbb 


p a TrrnrrTr 
bAlbbbbblb 


A TPP A P*P A/^ A 

AlbbAbbAbA 


TCACGGGGCA 


CGAGAAGGTC 


2760 




AACGCGCTGG 




ppappppapp 


ATPPTPAPPA 

AlbblbAbbA 


bbbbbblbbb 


GGTGCTGGCC 


2820 




GCGCGCGGCG 


appapppppp 


ppppappptp 


APPPTPPTPA 

AbbblbblbA 


pp a nnnTrnT 
bbAbbbibbl 


GGACTTCAGC 


2880 


20 


AACACCGGCG 


TGCTGGACCT 


GTTCATCGAC 


PAPPPPPPPP 


Ibbbbbibbb 


PP APA TP APP 

bbAbAlbAbb 


2940 




ATCGGCGAGA 


AGGCGCCCAA 


CGGCCCGGGC 


PTPPTPA APP 


PPA APPAPPT 

bbAAbbAbb 1 


bbbbAbbAbb 


3000 




TTCAGCTTCC 


TGCGCCCGAA 


CGACCTGGTC 


Tnr,AArTAPP 


TPPTPPPPA A 
IbblbbbbAA 


PTAPPTPA AP 

blAbblbAAb 


3060 




GGCGAGGCGC 


CGCCGCCCTT 


CGACCTGCTG 


TAPTPPA APT 


PPPAPAPPAP 
bbbAbAbbAb 


PA APATPPPP 

bAAbAlbbbb 


3120 




U\JVJVAA//1 1U1 


TPTPPTPPTA 


PPTPPPPAAP 


APPTAPPTPP 
AbblAbblbb 


AGAACAAGTT 


GCGCGTTCCC 


3180 


25 


GGTGCCCTGA 


CCATCTGCGG 


CGAGAAGGTG 


GACCTCTCGC 


GCATCGAGGC 


GCCGGTGTAC 


3240 




TTCTACGGTT 


CGCGCGAGGA 


CCACATCGTG 


CCCTGGGAAT 


CGGCCTACGC 


CGGCACGCAG 


3300 




ATGCTGAGCG 


GCCCCAAGCG 


CTATGTCCTG 


GGTGCGTCTG 


GCCACATCGC 


CGGCGTGATC 


3360 




MCCCCCCGC 


AGAAGAAGAA 


GCGCAGCTAC 


TGGACCAACG 


AGCAGCTCGA 


CGGCGACTTC 


3420 




ACCAGTGGC 


TGGAAGGCTC 


CACCGAGCAT 


CCTGGCAGCT 


GGTGGACCGA 


CTGGAGCGAC 


3480 
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TGGCTCAAGC 


AGCACGCGGG 


CAAGGAAATC 


GCCGCACCCA 


AGACTCCCGG 


CAACAAGACC 


3540 




CACAAGCCCA 


TCGAGCCCGC 


CCCCGGGCGT 


TACGTGAAGC 


AGAAGGCCTG 


AGCCGCGGCC 


3600 




CCTGAGCCTT 


CTTTAACCCG 


ACCTTGACAA 


ACGAGGAGAT 


AAGCATGACC 


GACATCGTCA 


3660 




TCGTCGCCGC 


AGCCCGCACC 


GCCGTGGGCA 


AGTTCGGCGG 


PAPPPTPPPP 


AAPAPPPPPP 




5 


CTCCGGAGCT 


GGGCGCCGTG 


GTCATCAAGG 


CCCTGCTGGA 


GAAGACGGGC 


PrTP A A crccc. 


3780 




ACCAGATCGG 


TGAAGTCATC 


ATGGGCCAGG 






PAPA APPPPP 






CGCGCCAGGC 


GATGATGAAG 


GCGGGCATCG 


ppaappaaap 




APPATPA APP 
ALLAlLAALb 


3900 




CCGTGTGCGG 


CTCCGGCCTC 


aapp,ppptpa 


tpptpppppp 

Ibblbbbbbb 


PPAPPPP ATP 

bbAbbLLAlL 


bbblbbbbLb 


3960 




ACAGCGACAT 


CGTCATCGCC 


nornnrrAfy; 


APA APATPAP 


UjlXAlrUXU 


CACGTGCTGA 


4020 


10 


TGGGCAGCCG 


CGACGGCCAG 


vAjV^n i VjVAAAj 


APTPPAAPAT 


pptpp a p a rr 
bblbbALAbb 


ATGATCAACG 


4080 




ACGGCCTGTG 


GGACGTGTAC 


AAPAAP.TAPP 


APATPPPPAT 


PAPPP/W 1 AP 

bALbbLLbAb 


MLblbbLLA 


4140 




AGGAACACGA 


CATCAGCCGC 


nArrAfXAnr. 


APPPPPTPPP 


rr^Trj^cc a pp 
LLlbbLLAbb 


CAGCAGAAGG 


4200 




CCACCGCCGC 


PtPappaappp 




APPAPPAPAT 

AbbAbbAbAl 


bbl ILLbblb 


tpp a Tnnmn 
IbbAlbbLbb 


4260 




AGCGCAAGGG 


CGACCCGGTG 


PTP/TTPP.AP A 


PPPAPPAPTT 


P ATP A AP A AP 

LAlbAALAAb 


A APAPPAPPP 

AAbALLALbb 


4320 




CCGAAGCGCT 


U\jv»AjU\j\A> 1 u 




TPPAPA APPP 


LbbLAbbblb 


a rrnmrm a 
ALLbiAibbbA 


4380 




ACGCCTCGGG 


CATCAACGAC 


GGCGCCGCTG 


ppp,tp,atpp,t 


PATPTPPPPP 


PPPA APPPPA 
bbLAAbbbbA 


4440 




AGGAGCTGGG 


CCTGACGCCC 


ATGGCGCGCA 


TPAAPAPPTT 


PPPPAPPAPP 


PPPPTPPATP 
bbbbJlbbAlb 


4500 




CGGCCACCAT 


GGGCATGGGC 


CCGGTGCCGG 


PPTPPtPP,PAA 


P/JPHPTrj^AP. 








GGCAGGTCGG 


TGACGTGGAC 


CTGTTCGAGC 


TCAACGAAGC 


PTTPPPPPPP 


papppptppp 




20 


CGGTGAACAA 


GGAGCTGGGC 


GTGGATCCGG 


CCAAGGTCAA 


CGTCAACGGC 


p,p,tPtPpatpp, 






CCATCGGCCA 


CCCCATCGGC 


GCCTCCGGCT 


GCCGCGTGCT 


GGTGACGCTG 


CTGCACGAGA 






TGCAGCGCCG 


GGACGCCAAG 


AAGGGCCTGG 


CCGCGCTGTG 


CATCGGCGGC 


GGCATGGGCG 


4800 




TGTCGCTGAC 


CGTCGAGCGC 


TGATCAGA^G 


AACCGGGCGG 


CCCCGCGCCG 


CCCGCCCGGC 


4860 




GTTCCACGCG 


GGTGCGCCGG 


GATACCAGAC 


PAAPPAAAPP 


AppA APPPPT 
AbLAAbbbb 1 


TPPAPAPPPP 
I bbAbALbbb 


4920 


25 


CCGAAGAAGG 


AGAGACAGAT 


GGCACAGAAA 


CTGGCTTACG 


TGACCGGCGG 


CATGGGCGGC 


4980 




ATCGGCACCT 


CGATGTGCCA 


GCGCCTGCAC 


AAGGACGGCT 


TCMGGTGAT 


CGCCGGCTCC 


5040 




GGTCCGAGCC 


GCGACCACCA 


GAAGTGGATC 


GATGAACAGG 


CCGCGCTGGG 


CTATACCTTC 


5100 




TACGCCTCCG 


TGGGCAACGT 


GGCCGACTGG 


GACTCCACCG 


TGGCCGCCTT 


CGAGAAGGTC 


5160 




AAGGCCGACJC 


ACGGCACCGT 


GGACGTGCTG 


GTGAACAACG 


CCGGCATCAC 


GCGTGACGGG 


5220 
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CAGTTCCGCA 


AGATGAGCAA 


GGCCGATTGG 


CAGGCCGTGA 


TPTPPAPPAA PPTPPAPAPP 






ATGTTCAACG 


TCACCAAGCA 


GGTGATCGAG 


GGCATGCTGG 


APAAPPPPTP CIT-TCCr hTC 






ATCAACATCT 


CCTCGGTCA^ 


CGGCGAGAAG 


PPPPAPTTPP 


PPPAPAPPAA PTAPTPPPPP 
U^O/HjaLLM L/ 1 AU I bbubb 






GCCAAGGCCG 


GCATGCACGG 


PTTPTPPATP 


PPPPTPPPPP 


appaaptppp nprrAticmr 
AbbAAblbbb bbbbAAbbbb 


5460 




GTGACGGTGA 


APAPPPTPAP 


PPPPPPPTAP 


ATPPPPAPPP 
ft. 1 lAibbAbbu 


APATPPTPAA WyPATYTW 

AbAlbblbAA bubbAlbbbb 


5520 




CAGGACGTGC 


TPPAPAAPAT 


PATPPPPAPP 


ATTPPPATPP 
Al IbbbAlbb 


bibbbblbbb lAbbbbbbAb 


5580 




papatppppt 


PPATPPTPPP 


PTPPPTPPPP 


Hfj^n app Ar*T 
bbbbAbbAbl 


CGGGCTTCAC CACCGGTGCC 


5640 




PAPTTPAPPT 


PPA APPPPPP 


PPTPPAPATP 

bblbbAbAlb 


bbblbAbbbb 


CGCGGCTCCA TGCCCACCTG 


5700 




PPTPPPPATP 


PAPPPPPPPA 


APPAPPPPAP 
AbbAbbbbAb 


CTCTGCGAGG 


GTGCGGCCTG CAAGGCTGAG 


5760 




blA/ibblbbb 


rrrrrTrrrr 
bbbbblbbbb 


ppp a rnnn a 
bbbAbbbbAb 


GTGCCGAAGC 


ACCAAAAGGC CGCGCATTGC 


5820 




pppppptttt 

bLAibLL Jill 


pptttptppa 
bbl 1 lblbbA 


lbbblbbbbA 


CGGGTGCCGC 


GTCAGGCAGG GCAGGGCCCC 


5880 




ppphtpttp a 

tuuUX 1 1 bn 


PTPPAPPATP 

b 1 bbAbbA i b 


rrcp ap atp a 
bbbbAbAlbA 


AGTACTTGAT 


CAGCCCCTTG GCCGCGAAGC 


5940 




PPAPPATPPP 


PA APPPPAPr 
unALaAA/Abb 


rrr M~^r \ apa 
bbbAbbAAbA 


GCACGAAGGT 


GCCGAACTTG CCGGCCTTCG 


6000 




APTPPPPPPP 
rtL 1 LljUbbbb 


P APPTP A A AP 

bAbblbAAAb 


ATP ATP A ATT 

AlbAlbAAlb 


CCATGTAGAG 


CATGMGGCC GTGACGCCGA 


6060 




LbblbAbbbb 


PAPPTP/V^PA 

bAbblljbbbA 


ATGTTTTCCT 


CGTTGATTTC 


GAACATCGTT TGTTGTCTCA 


6120 




GGCTGCTGCA 


CGCGGCTGAC 


GTGCTCGCCG 


CGCGGCCGGG 


CCCCAACTGC CCGCAGCGGT 


6180 




TCTCGATCAG 


GTTCTCAAGG 


CATCTCGTGC 


CACTGGGAGG 


TGTCCACCAG GTCGCGGTAG 


6240 




GCGTGCCAGC 


TCGAATGCGC 


CAGCCACGGC 


ACTACCACGA 


TCAGGCCCAG CAGCAGCGTG 


6300 




GCCATGCCCA 


GCAGCGTCAG 


CGCCATGATC 


AGCGCCGCCC 


ACAGCGCCAG CGGCAGTGGG 


6360 


20 


TGCTGCATCA 


CCACGCGCCA 


GCTCGTGAGC 


ACCGCCACCA 


GCACGCCCAC GTGGCGGTCC 


6420 




AGCAGCATCG 


GGATCC 








6436 



(2) INFORMATION FOR SEQ ID NO. : 2: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH : 1 161 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE : oligonucleotide 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO. : 2: 





ATGTCGGGCC 


TGAACCTGCC 


GATGCAGGCC 


ATGACCAAGC 


TGCAGGGCGA 


GTACCTCAAC 


60 




GAGGCGACGG 


CGCTGTGGA^ 


CCAGACGCTG 


GGGCGCCTGC 


AGCCCGACGG 


CAGCGCCCAA 


120 


5 


CCGGCCAAGC 


TGGGCGACCG 


GCGCTTCTCG 


GCCGAGGACT 


GGGCCAAGAA 


CCCCGCCGCG 


180 




GCCTACCTGG 


CGCAGGTCTA 


CCTGCTCAAT 


GCCCGCACGC 


TGATGCAGAT 


GGCCGAGTCC 


240 




ATCGAGGGCG 


ACGCCA<\GGC 


CAAGGCGCGC 


GTGCGCTTCG 


CCGTGCAGCA 


GTGGATCGAC 


300 




GCCGCGGCGC 


CGAGCACTT 


CCTGGCGCTC 


A4TCCCGAGG 


CGCAGCGCAA 


GGCGCTGGAG 


360 




ACCAAGGGGG 


AGAGCATCAG 


CCAGGGCCTG 


CAGCAGCTGT 


GGCATGACAT 


CCAGCAGGGC 


420 


10 


CACGTGTCGC 


AGACGGACGA 


GAGCGTGTTC 


GAGGTGGGCA 


AGAACGTCGC 


CACCACCGAG 


480 




GGCGCGGTCG 


TGTACGAGAA 


CGACCTGTTC 


CAGCTCATCG 


AGTACAAGCC 


GCTGACGCCC 


540 




AAGGTGCACG 


AGMGCCGAT 


GCTGTTCGTG 


CCGCCGTGCA 


TCAACAAGTA 


CTACATCCTG 


600 




GACCTGCAGC 


CGGACAACAG 


CCTCATCCGC 


TACACCGTCG 


CCCAGGGCCA 


CCGGGTGTTC 


660 




GTGGTGAGCT 


GGCGCA4CCC 


CGACGCCTCC 


GTCGCCGGCA 


AGACCTGGGA 


CGACTACGTG 


720 


15 


GAGCAGGGCG 


TGATCCGCGC 


CATCCGCGTG 


ATGCAGCAGA 


TCACGGGGCA 


CGAGAAGGTC 


780 




AACGCGCTGG 


GCTTCTGCGT 


CGGCGGCACC 


ATCCTGAGCA 


CGGCGCTGGC 


GGTGCTGGCC 


840 




GCGCGCGGCG 


AGCAGCCCGC 


GGCGAGCCTG 


ACGCTGCTGA 


CCACGCTGCT 


GGACTTCAGC 


900 




AACACCGGCG 


TGCTGGACCT 


GTTCATCGAC 


GAGGCCGGCG 


TGCGCCTGCG 


CGAGATGACC 


960 




ATCGGCGAGA 


AGGCGCCCM 


CGGCCCGGGC 


CTGCTCAACG 


GCAAGGAGCT 


GGCCACCACC 


1020 


20 


TTCAGCTTCC 


TGCGCCCGAA 


CGACCTGGTC 


TGGA<\CTACG 


TGGTGGGCAA 


CTACCTCAAG 


1080 






CGCCGCCCTT 


CGACCTGCTG 


TACTGGAACT 


CCGACAGCAC 


CAACATGGCC 


1140 




GGGCCCATGT 


TCTGCTGGTA 


CCTGCGCAAC 


ACCTACCTGG 


AGAACAAGTT 


GCGCGTTCCC 


1200 




GGTGCCCTGA 


CCATCTGCGG 


CGAGAAGGTG 


GACCTCTCGC 


GCATCGAGGC 


GCCGGTGTAC 


1260 




TTCTACGGTT 


CGCGCGAGGA 


CCACATCGTG 


CCCTGGGAAT 


CGGCCTACGC 


CGGCACGGAG 


1320 


25 


ATGCTGAGCG 


GCCCCA4GCG 


CTATGTCCTG 


GGTGCGTCTG 


GCCACATCGC 


CGGCGTGATC 


1380 




AACCCCCCGC 


AGAAGA^AA 


GCGCAGCTAC 


TGGACCAACG 


AGCAGCTCGA 


CGGCGACTTC 


1440 




A^CCAGTGGC 


TGGA4GGCTC 


CACCGAGCAT 


CCTGGCAGCT 


GGTGGACCGA 


CTGGAGCGAC 


1500 




TGGCTCAAGC 


AGCACGCGGG 


CA4GGA4ATC 


GCCGCACCCA 


AGACTCCCGG 


CAACAAGACC 


1560 




CACAAGCCCA 


TCGAGCCCGC 


CCCCGGGCGT 


TACGTGAAGC 


AG.AAGGCCTG 


A 


1611 



6 
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(2) INFORMATION FOR SEQ ID NO. : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1 179 base pairs 

(B) TYPE : nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE : oligonucleotide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO. :3: 

10 ATGACCGACA TCGTCATCGT CGCCGCAGCC CGCACCGCCG TGGGCAAGTT CGGCGGCACG 60 

CTGGCCAAGA CCCCCGCTCC GGAGCTGGGC GCCGTGGTCA TCAAGGCCCT GCTGGAGAAG 120 

ACGGGCGTCA AGCCCGACCA GATCGGTGAA GTCATCATGG GCCAGGTGCT GGCCGCCGGC 180 

GCGGGCCAGA ACCCCGCGCG CCAGGCGATG ATGAAGGCGG GCATCGCCAA GGAAACGCCG 240 

GCGCTGACCA TCAACGCCGT GTGCGGCTCC GGCCTCAAGG CCGTGATGCT GGCCGCCCAG 300 

15 GCCATCGCCT GGGGCGACAG CGACATCGTC ATCGCCGGCG GCCAGGAGAA CATGAGCGCC 360 

AGCCCGCACG TGCTGATGGG CAGCCGCGAC GGCCAGCGCA TGGGCGACTG GAAGATGGTC 420 

GACACCATGA TCAACGACGG CCTGTGGGAC GTGTACAACA AGTACCACAT GGGCATCACG 480 

GCCGAGAACG TCGCCAAGGA ACACGACATC AGCCGCGACC AGCAGGACGC CCTGGCCCTG 540 

GCCAGCCAGC AGAAGGCCAC CGCCGCGCAG GAAGCCGGCC GCTTCAAGGA CGAGATCGTT 600 

20 CCGGTCTCGA TCCCGCAGCG CAAGGGCGAC CCGGTGCTGT TCGACACCGA CGAGTTCATC 660 

AACAAGAAGA CCACCGCCGA AGCGCTGGCG GGCCTGCGCC CGGCCTTCGA CAAGGCCGGC 720 

AGCGTGACCG CGGGCAACGC CTCGGGCATC AACGACGGCG CCGCTGCGGT GATGGTGATG 780 

TCCGCCGCCA AGGCGAAGGA GCTGGGCCTG ACGCCCATGG CGCGCATCAA GAGCTTCGGC 840 

ACCAGCGGCC TGGATCCGGC CACCATGGGC ATGGGCCCGG TGCCGGCCTC GCGCAAGGCG 900 

25 CTGGAGCGCG CCGGCTGGCA GGTCGGTGAC GTGGACCTGT TCGAGCTCAA CGAAGCCTTC 960 

GCCGCCCAGG CCTGCGCGGT GAACAAGGAG CTGGGCGTGG ATCCGGCCAA GGTCAACGTC 1020 

AACGGCGGTG CCATCGCCAT CGGCCACCCC ATCGGCGCCT CCGGCTGCCG CGTGCTGGTG 1080 

ACGCTGCTGC ACGAGATGCA GCGCCGGGAC GCCAAGAAGG GCCTGGCCGC GCTGTGCATC 1140 

GGCGGCGGCA TGGGCGTGTC GCTGACCGTC GAGCGCTGA 1179 
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(2) INFORMATION FOR SEQ ID NO. : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 738 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE : oligonucleotide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO. : 4: 



ATGGCACAGA 


.AACTGGCTTA 


CGTGACCGGC 


GGCATGGGCG 


GCATCGGCAC CTCGATGTGC 


60 


CAGCGCCTGC 


ACAAGGACGG 


CTTCAAGGTG 


ATCGCCGGCT 


GCGGTCCGAG CCGCGACCAC 


120 


CAGAAGTGGA 


TCGATGAACA 


GGCCGCGCTG 


GGCTATACCT 


TCTACGCCTC CGTGGGCAAC 


180 


GTGGCCGACT 


GGGACTCCAC 


CGTGGCCGCC 


TTCGAGAAGG 


TCAAGGCCGA GCACGGCACC 


240 


GTGGACGTGC 


TGGTGAACAA 


CGCCGGCATC 


ACGCGTGACG 


GGCAGTTCCG CAAGATGAGC 


300 


AAGGCCGATT 


GGCAGGCCGT 


GATGTCGACC 


AACCTCGACA 


GCATGTTCAA CGTCACCMG 


360 


CAGGTGATCG 


AGGGCATGGT 


GGACAAGGGC 


TGGGGCCGGA 


TCATCAACAT CTCCTCGGTC 


420 


AACGGCGAGA 


AGGGCCAGTT 


CGGCCAGACC 


AACTACTCCG 


CCGCCAAGGC CGGCATGCAC 


480 


GGCTTCTCGA 


TGGCGCTGGC 


GCAGGAAGTG 


GCGGCCAAGG 


GCGTGACGGT GAACACCGTG 


540 


AGCCCGGGCT 


ACATCGCCAC 


GGACATGGTC 


AAGGCCATCC 


GCCAGGACGT GCTGGACAAG 


600 


ATCATCGCCA 


CCATTCCCAT 


CCGTCGCCTG 


GGTACGCCGG 


AGGAGATCGC CTCCATCGTC 


660 


GCCTGGCTGG 


CCGGCGAGGA 


GTCGGGCTTC 


ACCACCGGTG 


CCGACTTCAG CTGCAACGGC 


720 


GGCCTGCACA 


TGGGCTGA 








738 



(2) INFORMATION FOR SEQ ID NO. : 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 536 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
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(ii) MOLECULAR TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO. : 5: 



Asn Leu 
5 



5 Met Ser Gl 

Glu Tyr Leu Asn Glu Ala Thr Ala Leu 

20 25 



Thr Lys Leu 



Leu Gin Pro Asp Gly Ser 

35 

Phe Ser Ala Glu Asp Trp 



Gin Val Tyr Leu 
65 

15 He Glu Gly Asp 



Gin Trp 



He Asp 
100 



Leu Asn 
70 

Ala Lys 
85 

Ala Ala 



Glu Ala Gin Arg Lys Ala 
In Leu Trp 



Gly Leu Gl 
130 

Thr Asp Glu Ser 
145 

25 Gly Ala Val Val 



Val Phe 
150 
Tyr Glu 
165 



He Arg 
210 



Thr Pro Lys Val 
180 

Asn Lys Tyr Tyr 
195 

Tyr Thr Val Ala 



Ala Met 

10 

Trp Asn Gin Thr Leu 
30 

Ala Lys Leu Gly Asp 



Gin Gly 
15 

Gly Arg 

Arg Arg 



Pro Ala Ala Ala Tyr Leu Ala 



Ala Gin Pro 
40 

A 1 a Lys Asn 

55 60 

Ala Arg Thr Leu Met Gin Met Ala Glu Ser 

75 80 

Ala Lys Ala 



Arg Val 
90 



Ala Pro Ser 
105 

Leu Glu Thr 
120 

His Asp He 
135 

Glu Val Gly 



Lys Asn 
155 
Phe Gin 
170 



His Glu Lys 
185 

He Leu Asp 
200 

Gin Gly His 
215 



Arg Phe Ala Val Gin 
95 

Leu Ala Leu Asn Pro 
110 

Glu Ser He Ser Gin 
125 

Gly His Val Ser Gin 
140 

Val Ala Thr Thr Glu 



Leu Phe Val 
190 

Pro Asp Asn 
205 

Phe Val Val 
220 



Tyr Lys 
175 

Pro Pro 



Ser Leu 
Ser Trp 
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Arg 
225 
Glu 



Asn Pro Asp Ala 



Gin Gly 

5 His Glu Lys 

Ser Thr Ala 
275 

Ser Leu Thr 
10 290 

Leu Asp Leu 
305 

He Gly Glu 

15 Leu Ala Thr 

Tyr Val Val 
355 

Leu Leu Tyr 
20 370 

Cys Trp Tyr 
385 

Gly Ala Leu 

25 Ala Pro Val 

Glu Ser Ala 
435 

Val Leu Gly 
30 450 

Lys Lys Lys 
465 



Val He 
245 
Va 1 Asn 
260 

Leu Ala 



Ser Val 
230 

Arg Ala 



Lys Ala 
325 

Thr Phe 
340 

Gly Asn 

Trp Asn 

Leu Arg 

Thr He 
405 
Tyr Phe 
420 

Tyr Ala 



Thr Thr 
295 

Asp Glu 
310 

Pro Asn 

Ser Phe 

Tyr Leu 

Ser Asp 
375 

Asn Thr 

390 

Cys Gly 

Tyr G 1 y 

Gly Thr 

Gly His 
455 
Tyr Tr p 
470 



Ala Gly Lys 

He Arg Val 
250 

Gly Phe Cys 
265 

Ala Ala Arg 
280 

Leu Leu Asp 

Ala Gly Val 

Gly Pro Gly 
330 

Leu Arg Pro 
345 

Lys Gly Glu 
360 

Ser Thr Asn 

Tyr Leu Glu 

Glu Lys Val 
410 

Ser Arg Glu 
425 

Gin Met Leu 
440 

He Ala Gly 

Thr Asn Glu 



Thr Trp 
235 

Met Gin 

Val Gly 

Gly Glu 

Phe Ser 
300 

Arg Leu 
315 



Met Ala 
380 

Asn Lys 
395 

Asp Leu 

Asp His 

Ser Gly 

Val He 
460 
Gin Leu 



Gly Thr 
270 

Gin Pro 
285 

Asn Thr 

Arg Glu 

Asn Gly 

Leu Val 
350 

Pro Pro 
365 

Gly Pro 

Leu Arg 

Ser Arg 

He Val 
430 
Pro Lys 
445 

Asn Pro 



Tyr Val 
240 
Thr Gly 
255 

lie Leu 



475 



Gly Val 

Met Thr 
320 

Lys Glu 
335 

Trp Asn 

Phe Asp 

Met Phe 

Val Pro 
400 

He Glu 
415 

Pro Trp 

Arg Tyr 

Pro Gin 

Asp Phe 
480 
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Asn Gin Trp Leu Glu Gly Ser Thr Glu His Pro Gly Ser Trp Tip Thr 

485 490 495 

Asp Trp Ser Asp Trp Leu Lys Gin His Ala Gly Lys Glu He Ala Ala 

500 505 510 

Pro Lys Thr Pro Gly Asn Lys Thr His Lys Pro He Glu Pro Ala Pro 

515 520 525 

Gly Arg Tyr Val Lys Gin Lys Ala 
530 535 536 



(2) INFORMATION FOR SEQ ID NO. : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 392 amino acids 

(B) TYPE : amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO. : 6 : 

20 Met Thr Asp He Val lie Val Ala Ala Ala Arg Thr Ala Val Gly Lys 
5 10 15 

Phe Gly Gly Thr Leu Ala Lys Thr Pro Ala Pro Glu Leu Gly Ala Val 

20 25 30 

Val He Lys Ala Leu Leu Glu Lys Thr Gly Val Lys Pro Asp Gin He 
25 35 40 45 

Gly Glu Val lie Met Gly Gin. Val Leu Ala Ala Gly Ala Gly Gin Asn 

50 55 60 

Pro Ala Arg Gin Ala Met Met Lys Ala Gly He Ala Lys Glu Thr Pro 
65 70 75 80 

30 Ala Leu Thr He Asn Ala Val Cys Gly Ser Gly Leu Lys Ala Val Met 
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Leu Ala Ala Gin Ala He Ala Trp Gly Asp Ser Asp He Val He Ala 

100 105 110 

Gly Gly Gin Glu Asn Met Ser Ala Ser Pro His Val Leu Met Gly Ser 
115 120 125 

5 Arg Asp Gly Gin Arg Met Gly Asp Trp Lys Met Val Asp Thr Met He 
130 135 140 

Asn Asp Gly Leu Trp Asp Val Tyr Asn Lys Tyr His Met Gly He Thr 
145 150 155 160 

Ala Glu Asn Val Ala Lys Glu His Asp He Ser Arg Asp Gin Gin Asp 
10 165 170 175 

Ala Leu Ala Leu Ala Ser Gin Gin Lys Ala Thr Ala Ala Gin Glu Ala 

180 185 190 

Gly Arg Phe Lys Asp Glu He Val Pro Val Ser He Pro Gin Arg Lys 
195 200 205 

15 Gly Asp Pro Val Leu Phe Asp Thr Asp Glu Phe He Asn Lys Lys Thr 
210 215 220 

Thr Ala Glu Ala Leu Ala Gly Leu Arg Pro Ala Phe Asp Lys Ala Gly 
225 230 235 240 

Ser Val Thr Ala Gly Asn Ala Ser Gly He Asn Asp Gly Ala Ala Ala 
20 245 250 255 

Val Met Val Met Ser Ala Ala Lys Ala Lys Glu Leu Gly Leu Thr Pro 

260 265 270 

Met Ala Arg He Lys Ser Phe Gly Thr Ser Gly Leu Asp Pro Ala Thr 
275 280 285 

25 Met Gly Met Gly Pro Val Pro Ala Ser Arg Lys Ala Leu Glu Arg Ala 
290 295 300 

Gly Trp Gin Val Gly Asp Val Asp Leu Phe Glu Leu Asn Glu Ala Phe 
305 310 315 320 

Ala Ala Gin Ala Cys Ala Val Asn Lys Glu Leu Gly Val Asp Pro Ala 
30 325 330 335 

Lys Val Asn Val Asn Gly Gly Ala He Ala He Gly His Pro He Gly 
340 345 350 

12 
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Ala Ser Gly Cys Arg Val Leu Val Thr Leu Leu His Glu Met Gin Arg 

355 360 365 

Arg Asp Ala Lys Lys Gly Leu Ala Ala Leu Cys He Gly Gly Gly Met 

370 375 380 

5 Gly Val Ser Leu Thr Val Glu Arg 

385 390 392 



(2) INFORMATION FOR SEQ ID NO. : 7 
10 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 245 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

15 (ii) MOLECULAR TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO. : 7 : 

Met Ala Gin Lys Leu Ala Tyr Val Thr Gly Gly Met Gly Gly lie Gly 
5 10 15 

20 Thr Ser Met Cys Gin Arg Leu His Lys Asp Gly Phe Lys Val lie Ala 
20 25 30 

Gly Cys Gly Pro Ser Arg Asp His Gin Lys Trp He Asp Glu Gin Ala 

35 40 45 

Ala Leu Gly Tyr Thr Phe Tyr Ala Ser Val Gly Asn Val Ala Asp Trp 
25 50 55 60 

Asp Ser Thr Val Ala Ala Phe Glu Lys Val Lys Ala Glu His Gly Thr 
65 70 75 80 

Val Asp Val Leu Val Asn Asn Ala Gly He Thr Arg Asp Gly Gin Phe 
85 90 95 

30 Arg Lys Met Ser Lys Ala Asp Trp Gin Ala Val Met Ser Thr Asn Leu 
100 105 110 
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Asp Ser Met Phe Asn Val Thr Lys Gin Val He Glu Gly Met Leu Asp 

115 120 125 

Lys Gly Trp Gly Arg He He Asn He Ser Ser Val Asn Gly Glu Lys 

130 135 140 

5 Gly Gin Phe Gly Gin Thr Asn Tyr Ser Ala Ala Lys Ala Gly Met His 

145 150 155 160 

Gly Phe Ser Met Ala Leu Ala Gin Glu Val Ala Ala Lys Gly Val Thr 

165 170 175 

Val Asn Thr Val Ser Pro Gly Tyr lie Ala Thr Asp Met Val Lys Ala 

10 180 185 190 

He Arg Gin Asp Val Leu Asp Lys He He Ala Thr He Pro He Arg 

195 200 205 

Arg Leu Gly Thr Pro Glu Glu lie Ala Ser He Val Ala Trp Leu Ala 

210 215 220 

15 Gly Glu Glu Ser Gly Phe Thr Thr Gly Ala Asp Phe Ser Cys Asn Gly 

225 230 235 240 

Gly Leu His Met Gly 
245 



(2) INFORMATION FOR SEQ ID NO. : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 315 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULAR TYPE : promoter gene 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO. 
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ACACCGCGCC GAGCAAGGTG CCGTTGGGCG CCATGGCTTC GGCCACGGCC ATCATCAGCA 60 

CCACGTAACA GCCATGCCAG AGCAACCAAG TACATAGCAA AAACCCGCAA TTACGCAGAA 120 

TGACGTATTT CGTACAATGA AAACTGTTGT CATGATGCGG TAAGACACGA AGCCTACAAC 180 

GCGATCCAGC AACGGTTTTC GTGAAAAAGT CCTCAGGAGA CGAGCGTGAC ACTGCAAATC 240 

5 CCATTCCCGC ACTGCAACAG CTTGGCGACA ACGCCACGGC GCTGAGTGCC GCCATCTGGG 300 

AACGTGCGCG CGATG 315 



15 
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